Implications of glottal source for speaker and dialect identification
نویسندگان
چکیده
In this paper we explore the importance of speaker specific information carried in the glottal source. We time align utterances of two speakers speaking the same sentence from the TIMIT database of American English. We then extract the glottal flow derivative from each speaker and interchange them. Through time alignment and this glottal flow transformation, we can make a speaker of a northern dialect sound more like his southern counterpart. We also time align the utterances of two speakers of Spanish dialects speaking the same sentence and then perform the glottal waveform transformation. Through these processes a Peruvian speaker is made to sound more Cuban-like. From these experiments we conclude that significant speaker and dialect specific information, such as noise, breathiness or aspiration, and vocalization, is carried in the glottal signal.
منابع مشابه
Assimilation of Final Low Back Vowel in Eghlidian Dialect
In this article, the low back vowel /A/ in word-final positions in Eghlidian dialect, one of Persian dialects, is studied. This vowel is represented phonetically as [A], [o] and [@] in different phonetic environments. Therefore many words were collected via interviewing ten native speakers so that these different alternant forms can be accounted for appropriately. Since one of the authors of th...
متن کاملHow consonants, dialect and speech rate affect vowel devoicing?
We examined the glottal opening pattern during devoicing environment in Japanese, with respect to the factors that facilitate or suppress devoicing. The factors include consonantal environment, dialects, speech rate, consecutive devoicing environment and phrase final position. The results indicated that glottal opening patterns are twofold: a single phaseand a double phase opening for /CVC/. On...
متن کاملSpeaker Identification Using Glottal-Source Waveforms and Support-Vector-Machine Modelling
Speaker identification experiments are performed with novel features representative of the glottal source waveform. These are derived from closed-phase analysis and inverse filtering. Source waveforms are segmented into two consecutive periods and normalised in prosody, forming so called source-frame feature vectors. Support-vector-machines are used to construct speaker discriminative hyperplan...
متن کاملModeling of the glottal flow derivative waveform with application to speaker identification
Speech production has long been viewed as a linear filtering process, as described by Fant in the late 1950's [10]. The vocal tract, which acts as the filter, is the primary focus of most speech work. This thesis develops a method for estimating the source of speech, the glottal flow derivative. Models are proposed for the coarse and fine structure of the glottal flow derivative, accounting for...
متن کاملThe Status of [h] and [ʔ] in the Sistani Dialect of Miyankangi
The purpose of this article is to determine the phonemic status of [h] and [ʔ] in the Sistani dialect of Miyankangi. Auditory tests applied to the relevant data show that [ʔ] occurs mainly in word-initial position, where it stands in free variation with Ø. The only place where [h] is heard is in Arabic and Persian loanwords, and only in the pronunciation of some speakers who are educated and/or...
متن کامل